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Since the discovery of short-period exoplanets a decade ago, photometric surveys 
have been recognized as a feasible method to detect transiting hot Jupiters. Many 
transit surveys are now under way, with instruments ranging from 10-cm cameras to 
the Hubble Space Telescope. However, the results of these surveys have been much 
below the expected capacity, estimated in the dozens of detections per year. 
' One of the reasons is the presence of systematics ("red noise") in photometric 

, time series. In general, yield predictions assume uncorrelated noise ("white noise"). 

QO ■ In this paper, we show that the effect of red noise on the detection threshold and the 

CO ', expected yields cannot be neglected in typical ground-based surveys. We develop a 

. simple method to determine the effect of red noise on photometric planetary transit 

detections. This method can be applied to determine detection thresholds for transit 
^ (— i surveys. We show that the detection threshold in the presence of systematics can be 

O much higher than with the assumption of white noise, and obeys a different dependence 

I 1 on magnitude, orbital period and the parameters of the survey. Our method can also 

be used to estimate the significance level of a planetary transit candidate (to select 
promising candidates for spectroscopic follow-up). 

We apply our method to the OGLE planetary transit search, and show that it 
provides a reliable description of the actual detection threshold with real correlated 
noise. We point out in what way the presence of red noise could be at least partly 
responsible for the dearth of transiting planet detections from existing surveys, and ex- 
amine some possible adaptations in survey planning and strategy. Finally, we estimate 
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the photometric stability necessary to the detection of transiting "hot Neptunes" . 

Key words: planetary systems - methods: data analysis - methods: statistical 
techniques: photometric - surveys 



1 INTRODUCTION 

Many photometric surveys for transiting exoplanets are now 
under way, with a wide variety of instrumentation and ob- 
servation strategies - from monitoring large areas of the sky 
with small telescopes to deeper surveys on star clusters or 
the Galactic disc with l-4m telescopes 1 . Altogether, thou- 
sands of telescope nights have been invested in these surveys, 
monitoring hundreds of thousands of target stars in the So- 
lar neighbourhood and in the Galactic disc. However, even 
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after years of operation, the results of these surveys failed to 
meet the expectations, with only a slow trickle of detections 
instead of the expected bounty. 

Part of the mismatch between expectations and actual 
performance can be attributed to the fact that these sur- 
veys were often assuming that the first known transiting 
extrasolar planet and for long the only one, HD2094586 
JCharbonneau et alJ boOO). was typical of hot Jupiters. Its 
radius of ~ 1.3 Rj creates a 2% transit signal on a solar-type 
star. Subsequently discovered transiting gas giants showed 
the large radius of HP 20945 8 to be an excepti on rather than 
the rule dAlonso et alJl2004 IPont et alJl2004l) . The mode of 
hot J upiter radii is probably near 1.1 Rj or lower jGaudil 
2005), producing correspondingly shallower eclipses. 

Still, even assuming smaller gas giants, transit surveys 
keep predicting many more detections than they actually 
yield. Understanding this mismatch is essential both for in- 
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terpreting the results in terms of planetary statistical prop- 
erties, and in order to improve the planning and strategy 
of the surveys. Indeed, if the statistical properties of hot 
Jupiters were not known from radial velocity surveys, it is 
very likely that drastically different conclusions would have 
been drawn from the results of the transit surveys. Upper 
limits would have been put to the abundance of hot Jupiters 
one or two orders of magnitude below those derived from 
Doppler surveys. 

The ingredients in the prediction simulations seem solid: 
the abundance of hot Jupiters is relatively well known from 
radial velocity surveys, the statistics of targets are obtained 
from well-tested models of Galactic stellar populations, and 
the rest comes from simple orbital mechanics. The setting of 
the transit detection threshold is often considered a minor 
component in the simulations. The threshold is generally 
modelled as a minimum signal-to-noise ratio ("S/N") of the 
transit detections assuming uncorrelated noise in the pho- 
tometric data. 

In this paper we show that, contrary to these assump- 
tions, the correlation of photometric data at the millimag- 
nitude level cannot be neglected when determining the de- 
tectability of planetary transits, and that taking the correla- 
tion into account can strongly affect the detection threshold, 
and consequently the estimates of the potential of photomet- 
ric surveys in terms of planet detetion. 

We develop a simple method to assess the significance 
of detected transit candidates in the presence of such "red 
noise" . We propose a method that is robust and gives real- 
istic results, while at the same time remains simple to use 
and to apply to any ground-based transit survey. 

Ongoing transit surveys have shown that detecting 
transiting signals was not the end of the story. For transit 
depths in the range of a few percents, by far the largest num- 
ber of detections are due to eclipsing binar ies, either small 
transiting M dwarfs (e.g. iPont et al1l2005l) or eclipsing bi- 
naries diluted by the lig ht of an unresolved companion (e.g. 
iMandushev et all |2005T> . The identification of true transit- 
ing planets among all identified transit candidates requires 
a considerable investment in spectroscopic follow-up obser- 
vations. For transit depths below ~ 2% , the odds be come 
more favourable to transiting planets ferownl [2003) , but 
since such transit depths are near the detection threshold 
of most ground-based surveys, false positive detections start 
being a source of contamination. False positives require even 
more follow-up observations than eclipsing binaries. There- 
fore the assignment of reliable significance levels to transit 
detections near the detection threshold, is another motiva- 
tion for a robust method to assess the significance of transit 
candidates in the presence of systematics or "red noise" . 

A third motivation is to derive realistic estimates for the 
uncertainty on planetary parameters derived from transit 
lightcurves, taking into account the effect of systematics in 
the photometry. 

In Section[2]we present our method to compute the sig- 
nificance level of a transit detection. In Section 3 we examine 
the implications for transit surveys. Section 4 summarizes 
the results of this paper and points out some interesting 
consequences. 
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Figure 1. Lightcurve for a planetary transit candidate, with the 
step function used in the detection procedure (OGLE-TR-132, 
lUdalski et al.l2003t) . Top: short excerpt of the lightcurve. Bottom: 
phase-folded lightcurve around the detected transit. 



2 MODIFIED DETECTION STATISTIC FOR 
RED NOISE 

2.1 White- noise statistics 

The signal produced in a stellar lightcurve by a planetary 
transit can be approximated by a strictly periodic step func- 
tion (Fig. , with a depth related to the radius ratio of the 
two bodies, and a duration related to the orbital elements 
and the primary radius. Transit detection algorithms usually 
work by fitting a step function to the phase-folded signal, or 
by detecting a step-like decrease and increase in the flux. 

Let us assume that a possible transit signal has been 
detected in a lightcurve consisting of N flux measurements 
fi, with uncertainties a. The flux is normalised so that the 
mean flux outside the transit signal is 1, < f° ut >= 1 . 

For simplicity we first assume that the measurement un- 
certainties are equal for all data points, Oi = ctq fSection |'2.11l 
discusses the generalisation to unequal uncertainties). Let d 
be the best-fitting transit depth, and n the number of data 
points in the transit, d will then be the difference between 
the mean of the data points in the transit and the flux level 
outside the transit: 



d=l-< f" >- 



Eft 



The uncertainty on d is the error on the mean of f" 1 
(since n « N for planetary transits, we neglect the error 
on < f° ut >). Using the expression for the uncertainty on 
the mean under the assumption of uncorrelated noise gives: 



ffd = <r{< f- n >) = (70 



(1) 



The uncertainty on d decreases with the square root of the 
number of points in the transit. 

Intuitively, we can interpret d as the "signal" in the 
transit detection procedure, and ad as the noise. A natural 
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Figure 2. Lightcurves with white noise only (top), red noise only 
(middle) and white and red noise (bottom). Typical light curves 
from high-precision rapid time-series photometry for bright tar- 
gets in transit surveys resemble portions of the bottom curve. 



statistic for the significance of a transit detection is thus the 
S/N ratio of d, which we write Sd- 



Sd = d/dd 



d 

— ? 



,V2 



(2) 



This expression is the one most commonly used to evaluate 
the significance of transit detections as well as to predict 
detection thresholds from the characteristics of the observa- 
tions. Statisticians sometimes refer to Sd as the 'Wald test' 
statistic. 



2.2 Correlated residuals and coloured noise 

The derivation of Eq. [5] requires a crucial assumption: the 
photometric errors were assumed to be uncorrelated, i.e. 
purely white noise was assumed. In fact, the errors on 
ground-based millimagnitude photometry in rapid time se- 
ries are correlated. Trends, or "systematics" are present, re- 
lated to changing airmass, atmospheric conditions, telescope 
tracking, Hatfield errors (usually a combination of several 
of these factors). These effects introduce some covariance 
between the lightcurve data points, with timescales similar 
to the duration of planetary transits. Typical transits for 
close-in planets last 2-3 hours, which is also the timescale of 
airmass, seeing and tracking changes in ground-based pho- 
tometric observations. In the parlance of signal analysis, the 
noise on photometric observations is quite red, with a low- 
frequency component. Figure [5] illustrates the difference be- 
tween white noise and noise with a red component. Figure [3] 
gives a real-life example of a lightcurve from a planetary 
transit search. Red noise is apparent at the millimagnitude 
level in this time series. 



Figure 3. Example of a real lightcurve from a planetary transit 
survey (the OGLE survey). The intervals between different nights 
were compressed for the display. The vertical range of the plot is 
0.03 mag. Systematic trends are visible on several timescales to 
the level of a few millimagnitudes. 



2.3 Transit depth uncertainty with covariance 

The equivalent of Eq. in the presence of correlated noise 
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n, 2 n ^ n 2 



(3) 



where dj are the covariance coefficients between the i 
and j th measurements, where the i and j indexes cover the 
measurements taken during the transit. The diagonal of the 
matrix C contains the individual errors of (random uncer- 
tainty on the i th measurement), and in Eq. [3] they are all 
assumed equal to cjq. 

The first part of this expression is Eq.Q Equation is 
therefore a valid approximation only when: 
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>> 



But in fact, in many cases relevant to planetary transit sur- 
veys, this inequality is not satisfied. Quite the contrary, the 
covariance term may even be larger than the white-noise 
term: 



1 



n 2 ^ n 

so that the full expression in Eq.EJshould be used to estimate 
the uncertainty on the transit depth. 

Thus Eq.|2]is no longer valid in the presence of red noise. 
The uncertainty on the mean does not decrease as n 1 ^ 2 . Ob- 
viously, if neighbouring points are correlated, having more 
points during the transit does not increase the detection 
statistics as much as if they were uncorrelated. 

This can have a drastic effect on the transit detection 
threshold, as illustrated in Fig |5] The top curve is drawn 
with uncorrelated, normally distributed residuals, the mid- 
dle curve with a red T// noise', the bottom curve with a 
composite noise. All three curves have identical dispersions. 
Curves like the bottom panel of Fig. [5] will look familiar 
to observers in the field, while curves like the top panel will 
look either like the result of an incredibly good night or mea- 
surements on rather faint objects for which all other sources 
of noise are dominated by the statistical photon noise, so 
that the residuals are uncorrelated and Gaussian. The mid- 
dle and bottom curves are much more likely to produce false 
transit detections (and to hide a real transit signal) than the 
top curve. 

Let us assume a transit duration lasting 50 times the 
interval between two points. The dispersion of the mean of 
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50 consecutive points in the top curve is 0.14 (= l/\/50). 
In the middle curve, this dispersion is 0.40. Therefore, the 
middle curve produces a noise for transit detections three 
times larger than the top curve, even though both curves 
have the same overall dispersion of the residuals. 

Figure 01 displays , for representative targets of th e 
OGLE transit survey 2 llldalski et alJlt^albld EooH . 12004ft . 
the behaviour of the scatter of individual points as a function 
of magnitude, and the scatter of 10-adjacent-point averages 
(10 points span about 2.5 hours, a typical transit duration 
for hot Jupiters), compared with the expected scatter of 
10-adjacent-point averages in the presence of uncorrelated 
white noise. The objects are those of the 2001 season (Galac- 
tic bulge) . The transit signals were removed beforehand and 
only objects with negligible sinusoidal modulations in the 
lightcurve were included. Figure [I] shows how the n -1 / 2 de- 
crease of the noise does not apply for most of the survey 
objects. The actual dispersion of the mean of 10 consecu- 
tive points is often much larger than (Jo/vTO, except for the 
faintest targets in the survey. For the brightest object, the 
10-point scatter is even comparable to the 1-point scatter, 
showing that red noise dominates. Since the brightest tar- 
gets are the most favourable for the detection of transiting 
planets, Fig.0]shows that the white noise assumption is not 
justified and an account of covariance must be introduced. 

2.4 Transit signal-to-noise with covariance, S r 
If we plug Eq. |3]into Eq.|2]we get: 



\J -f + TIT E^j C ij 

The r subscript is used to denote the presence of red noise. 

In the presence of red noise, Cij for i ^ j is positive and 
causes the significance of the transit to increase more slowly 
than the familiar n 1//2 from Eq. Q In the limit when ao is 
small and n is large, the significance may even be totally 
dominated by the covariance term. 

2.5 Simple model of the covariance structure 

In general, the full covariance matrix of the photometric 
data is not known, but some reasonable assumptions can 
be made about it. A satisfactory proxy to the effect of the 
covariance matrix is proposed below, and it is shown to be 
sufficient to offer a large improvement over the white-noise 
approximation both to assess the confidence level of a given 
transit candidate and to estimate the detection threshold of 
a given transit survey. 

For the orbital periods of interest in planetary tran- 
sit searches, the duration of the transits, noted I, is small 
compared to the period, P. Therefore, the data points in 
the transit will consist of a few stretches of data of dura- 
tion lower than or equal to the duration of the transit I, 

2 The OGLE survey has been up to now the most successful 
survey f or transiting planets in terms of detections, 5 confirmed 
planets iKonacki et all2003tlBouchv et all2004lPont et al.l2004t 
iBouchv et all200EtlKonacki et al.l2005l) and 177 published tran- 
sit candidates. Throughout this article we shall draw illustrations 
from the OGLE candidates. 




13 14 15 16 17 

Magnitude 



Figure 4. Standard deviation as a function of magnitude for the 
published candidates in the OGLE survey in the 2001 fields for 
individual points (filled circles) and for 10-point averages (trian- 
gles). The stars represent the expected position of the 10-point 
averages assuming pure white noise (a/y/n). The solid line is the 
expected dispersion of individual points due to the white photon 
noise, while the dashed line shows the corresponding dispersion 
for 10-point averages. For most objects the dispersion of 10-point 
averages is much higher than expected for white noise, especially 
for brighter magnitudes. The dotted line shows the expected dis- 
persion of the 10-point means according to the discussion in this 
article, with an amplitude of oy = 3.6 mmag for the red noise. 



taken multiples of P days apart. We make the plausible as- 
sumption that the covariance is a monotonically decreasing 
function of the time difference between two measurements. 
Because I « P, the covariance term for points in differ- 
ent transits will be much smaller than for points during the 
same transit. This implies that to a good approximation the 
matrix C will be block-diagonal. If Ntr is the number of tran- 
sits sampled, and rik the number of points in the kth transit, 
then the significant elements in the covariance matrix will 
consist of blocks of size by : 

same night diff nights 



Ntr fc th transit 

* E E 

fc=i 

The uncertainty of d in Eq.[3]will then be: 

N tr fc th transit 

^ = ^E E c *- ( 5 ) 

If the sampling interval is constant, then the inner sums 
in Eq.|^]are functions of nt alone. Thus, in order to calculate 
the transit significance S r in the presence of red noise, it is 
sufficient to estimate the function V defined as 
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nXn block 

without the need to fully calculate the individual CVj nor to 
make specific assumptions on the dependence of the covari- 
ance on the time separation between two points. 
For a given lightcurve Eq. |^|then becomes: 



= —2 ^ V(n k )n 2 k 



(6) 



and the detection S/N over all individual transits will be 

„2 



Si = d 2 



E":> 2 fc V(n fe ) 



(7) 



The uncertainty on the depth of a single transit will simply 
be: 



a d = V 1/2 (n) 



2.6 Estimating V(n) for a given lightcurve 



(8) 



We can estimate the function V(n) for a given transit can- 
didate using the lightcurve data points themselves, fi, by 
relating V(n) to the variance of the average of n points in a 
time interval / outside the transit signal, using the following 
procedure: 

a) remove the points in the transits, as well as any apri- 
ori known systematic effects in the signal; 

b) calculate the mean of the flux, Fj, over a sliding inter- 
val of duration I, equal to the duration of the detected tran- 
sit, recording the number of points rij in the interval. The 
interval slides along the whole time series, in steps smaller 
than the time sampling interval; 

c) group the means Fj into bins, according to the num- 
ber of points in the interval rij ; 

d) calculate the variance of Fj separately in each bin rij 
- that serves as an estimate of the V(n) function. 

This procedure has the considerable advantage of re- 
quiring no external assumption on the covariance of the 
residuals. It estimates V(n) directly from the data itself, pre- 
serving the exact covariance structure of the data as sampled 
outside the transit. For instance, photometric trends can be 
more important at higher airmass, near the edge of each ob- 
serving sequence. In that case this effect will be reflected by 
an increase in V(n) for small n (i.e. for cases when there are 
only few points in a transit-length interval, which can only 
occur at the beginning or end of the observing sequence). 

Note that for the simple case of pure white noise (i.e. a 
diagonal covariance matrix) we expect V(n) to scale as 1/n, 
while for perfectly correlated samples V(n) will be indepen- 
dent of n. 



2.7 Example from the OGLE planetary transit 
survey 

We have applied the above method to some transit candi- 
dates published by the OGLE survey. Figure |K| displays, by 
three representative examples, the behaviour of the V(n) 
function. We plot the more familiar value V 1 / 2 (correspond- 
ing to the standard deviation of the average rather than the 
variance). On this plot an uncorrelated signal is expected 
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Figure 5. The V(n) function for three representative OGLE 
lightcurves, with brightness increasing from top to bottom. V 1 / 2 
(corresponding to the standard deviation of the average of n suc- 
cessive points) is plotted as a function of n. In each panel, the 
empty diamond shows the standard deviation of individual data 
points in the lightcurve. The dotted line shows a a/-/n relation, 
as expected with uncorrelated noise. The solid line is a fit of 
V(n) = <7 2 /n + <7 2 (Eq.EJ. 



to produce a n -1 / 2 relation, while entirely correlated points 
will follow a flat relation. Figure |S] shows that in the OGLE 
photometry, sometimes V(n) evolves quite nearly as n -1 ^ 2 
as expected for white noise, while in other cases it is almost 
flat. In general, V(n) decreases less rapidly with n for faint 
objects than for bright objects. This is obviously related to 
the fact that as the photon noise becomes larger, it dom- 
inates the red noise (the "systematics" ) . For the brightest 
objects, V(n) is asymptotically constant, which is expected 
if the points were perfectly correlated. Therefore the noise is 
completely dominated by low- frequency systematics, which 
means that gathering more points in a given stretch of time 
does not add much information. 

It is interesting to note that the asymptotic lower limit, 
towards which V(n) seems to converge, is similar for all the 
objects regardless of magnitude (see also Fig.^Jand Fig.|SJ. 

Note also that V(l) is not equal to the mean variance 
of all points (indicated by empty diamonds in Fig. This 
is because V(l) is estimated only with data points that are 
isolated in a duration I, i.e. points situated at the beginning 
or end of a stretch of measurements. It is quite likely that 
these points will tend to be measured at higher airmass val- 
ues and with atmospheric conditions varying more rapidly, 
so that their scatter will be larger than average. 



2.8 Comparison of the S r statistic with the 
white-noise equivalent 

Obviously, one goal in a transit survey is to detect as many 
transiting planets as possible. In hypothesis testing jargon, 
we basically wish to reduce the probability of type-I error 
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(false negative). However, this means reducing the detection 
threshold, which in turn implies increasing the type-II error 
(false positive). The number of false positives is the practical 
bottleneck, since it bears an immediate implication on the 
amount of follow-up observations needed. We therefore have 
to fix the threshold according to the amount of false positives 
we are willing to tolerate, taking into account the size of the 
survey. 

In order to compare the performance of the S r statistic 
to that of white- noise statistic Sd, we produced two syn- 
thetic datasets: each dataset consisted of 1000 lightcurves 
with no transits, and 1000 lightcurves with transits of vary- 
ing periods, phases, depths, and noise levels. The simulated 
observations lasted for 50 nights, with a sampling interval of 
15 minutes. The two datasets differed by the way the noise 
was generated - the first dataset had a pure white noise, 
while the second had a strongly covariant noise with an ex- 
ponentially decaying correlation - p(At) = e~~ , with a 
time constant of r = 13 minutes. At the starts and ends of 
nights the correlation was artificially increased to simulate 
high airmass effects. No inter-night correlati on was intro- 
duced . We ran the BLS detection algorithm jKovacs et alJ 
2002) on the two datasets and computed Sd and S r for all 
the lightcurves. A lightcurve for which the true period was 
found by BLS (to reasonable accuracy) was tagged as a de- 
tection. By varying the detection threshold we change the 
error rates of both types. Figure |S] shows the behaviour of 
the two error types for the two datasets. The dashed curve 
represent the performance of Sd while S r is represented by 
the solid curve. In the case of pure white noise (upper panel) 
the difference between Sd and S r is marginal. However, in 
the case of the red- noise dataset (lower panel), at each level 
of type-II error (false positive) rate we can reach a consid- 
erably lower type-I error (false negative) rate using S r - 
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Figure 6. Type-I (false negative) and type-II (false positive) er- 
rors for the two simulated datasets with pure white noise (upper 
panel) and extremely red noise (lower panel). The curves are pro- 
duced by varying the detection threshold, which controls the two 
types of error rates. The dashed curve represents the performance 
of Sd and the solid curve that of S r - 



2.9 Single-parameter description of the covariance 
with ay 

To model the behaviour of V(n) in the presence of red noise, 
let us represent the red noise as a Fourier sum of sine curves. 
For a sine signal, the dispersion of the mean of a set of points 
over a duration I is a very strong function of the wavelength 
of the sine signal. Figure[7|gives this dispersion as a function 
of the wavelength. This dispersion is the amplitude of the 
systematics that will remain in the signal regardless of the 
number of points measured during an interval of duration 
I. It is seen to peak at wavelengths slightly above twice the 
duration of the transit. Schematically, the correlated noise 
can be thought of as consisting of three components: short 
frequencies that will tend to average out over the duration of 
the transit, long frequencies that will not vary between the 
transit and the neighbouring measurements, and frequencies 
around (2Z) _1 that will introduce strong transit-like residu- 
als. 

Let us assume then that the noise can be separated 
into a purely white component, noted a w , and a purely red 
component, noted ay (where w and r stand for "white" and 
"red"), expressing the power of the correlated noise at fre- 
quencies near (2Z) _1 . 

a m results from several independent white noise com- 
ponents such as photon noise, sky noise and scintillation. 
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Figure 7. Standard deviation of the average over a duration I 
for sine signals of unit dispersion and different wavelengths. The 
wavelength is expressed in units of I. 
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In general it is a function of the target magnitude, oy re- 
sults from systematics (correlated noise). Both parameters 
a w and a r can be derived from a given lightcurve from 
the behaviour of V(n). For purely white noise, the variance 
scales as V(n) = ajj,/n, while for purely red noise it scales as 
V(n) = a 2 (see Section ^. bl and Fig|5J. In intermediate cases, 
we can model it as V(n) = a^/n + a 2 , so that it is near a^/n 
for individual points and approaches of as n grows larger. 

We have calculated the values of a w and oy for pub- 
lished candidates in the case of the OGLE survey. The re- 
sults are displayed in Figure [8] The amplitude of the white 
noise has a familiar dependence on magnitude, with photon 
noise dominating except at the bright end. The red noise 
shows no dependence on magnitude. Its mean is 3.6 mmag 
for the 2001/2 season and 3.1 mmag for the 2002/3 sea- 
son. The difference is probably due to a less crowded field 
and improvements in the reduction procedure for the second 
season. 

Using published information, we infer that values near 
3 mmag for ay are typical of some other surveys as well. For 
instance, in a tr ansit survey c entere d on NGC 2301 with 
a 2.2m telescope, iTonrv et alJ 1120051) find a red noise of ~ 
3 mmag (see their figures 4 and 5) 3 . Assuming that red 
noise dominates for the brightest targets, similar values of 
oy are also inferred from the magnitude-dispersion plots and 
transit candidate depths for several other surveys cited in 
the Introduction. 



2.10 Parameter dependence of the S r statistic 

In order to estimate V(n) and S r when the actual lightcurves 
are not available, we use the expression of V(n) from the 
previous section: 



V(n) = a't/n + ay . 

We can then write: 

N tr , , 

W \ rife 

fc=i v 

and express S r as: 

where N tr is the number of transits sampled, nt is the num- 
ber of data points in the k th transit. 

To relate Eg. 1111 to the transit detection threshold, we 
express it in terms of physical parameters. For simplicity we 
assume here a homogeneous distribution of data points in 
phase, so that rife ~ n/Nt r for all transits sampled. Then 
Eq. 1111 becomes: 



0) 



(10) 



(11) 



d 1 



+ 



(12) 



Ntr 



3 Note that intrinsic variability is also inclu ded in the red noise 
in addition to systematics in the photometry. ITonrv et alJ (2005) 
find a very high occurence of variability at the millimag level. 
These have the same effect as systematics on the detection of 
transits. 




Figure 8. cr w (white symbols) and oy (black symbols) for OGLE 
candidates from the 2001/2 fields. The amplitude of oy is found 
to be of a few millimagnitudes in almost all cases. In fact, given 
the uncertainty in the determination of oy itself (involving a 
quadratic subtraction), values of a r for the different targets are 
remarkably similar. It therefore seems that to a fair approxima- 
tion, all targets in the survey can be thought of as affected by a 
similar red noise component, largely independent on magnitude. 
The mean value is oy = 3.6 mmag for the 2001/2002 targets 
(Galactic bulge) and o r =3.1 mmag for the 2002/2003 targets 
(Carina). 



If N is the total number of data points per star, and 8 t 
the typical time interval between two measurements, then 
the number of points during the transit is n = /3NR/na 
(with a = cP 2//3 Af 1/ ^ 3 ), and the mean number of transits 
sampled is Ntr = NSt/P, where a is the orbital semi-major 
axis (assuming a circular orbit), P the orbital period, M 
and R the mass and radius of the primary, c is a constant 
equal to 1 in Sun-Earth units (Mq, years, AU), and (5 is a 
correction factor due to the latitude of the transit ((3—1 
for a central transit). 

Then Eg. 1101 becomes: 

2 _ alP 2/3 M 1/3 ofP 

° d - CT Mr + m ' 



and Eq. [TT] 

S 2 r = d 2 /o 2 d = a(r/R) 4 /aj = 



a(r/R) 4 



2p2/3 M l/3 2p 



(3NR 



+ 



N5 t 



(13) 



is the detection significance in the presence of systematics, 
expressed as a function of the characteristics of the planetary 
system, r is the radius of the planet and a a parameter 
dependent on limb darkening, accounting for the fact that 
actual transit signals are a bit deeper than (r/R) 2 because 
of limb darkening: d = a(r/R) 2 . 

The limit of this expression in the uncorrelated noise 
regime (cr w /y/n » oy) is 
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Sr = (i^j V2 (r/R) 2 N 1/2 R 1/2 p- 1/3 M~ 1/6 a^ , (14) 

which corresponds to the white-noise expression Eq.|2] while 
the limit in the correlated regime is 

S r = Q 1/2 (r/i?) 2 iV 1/2 5 i 1/2 P" 1/2 a,7 1 . (15) 

Numerically, expression 1151 is very different from ex- 
pression^] For typical parameters of transit surveys (e.g. 
R = 1 R , M = 1 M Q , TV = 1000, S t = 15 min, a ~ 1, 
P = 3.5 days, r = 0.1 R , a w = 0.003, oy = 0.003), SV is 
about 3 times larger than predicted by white noise alone. 
This has profound implications for the yield prediction and 
interpretation of transit surveys. 

Eq. 1121 can be expressed in terms of its white-noise 
equivalent. A bit of algebra gives: 

S r = S w [l + (a r /a ) 2 n k ]~ 1/2 , 

where S w = d/aoy'n is the white-noise significance. When 
red noise dominates, this reduces to 

Sr — S w Tl^ ^ , 

i.e. the significance is reduced by the square root of the typ- 
ical number of points in an individual transit. For the pa- 
rameters above, n k ~ 10. Hence the result that the detection 
threshold is about three times higher for bright targets in the 
OGLE survey than indicated by the white-noise approach 
(see Section f3.H . Other surveys typically have a smaller St, 
so that the reduction factor is even larger. 

2.11 Generalisations 

In many cases the uncertainties of the individual data points 
are not constant. An additional assumption is then needed 
about the relation between the white noise and the red noise. 
A simple way to generalise our method is to assume that the 
red and white noise are proportional. This is reasonable, for 
instance, if higher uncertainties are due to a higher airmass 
or poor weather conditions, which are expected to increase 
both the white and red noise. In that case, the equivalent 
of Eq.[7|can be computed from the variance of the weighted 
mean of the flux in a transit-length interval. However, there 
may be situations when the red noise is not proportional to 
the white noise. For instance, if the exposure time changes 
by a significant factor, the photon noise will change, but 
the photometric systematics may remain unaffected. In that 
case, a possible solution is to parametrize V as a combination 
of red and white noise as in Section [2.91 and to determine 
ay on the assumption that a m is proportional to the photon 
noise. In most cases, though, it may be sufficient to assume 
a mean constant uncertainty to compute V even if the real 
uncertainties somewhat vary. 

Equation |7| can also be generalised to account for un- 
equal depths in the different transit sampled. Instead of us- 
ing the same d for all transits in Eq. [I]and[7| the S/N can 
be summed over all the transits using the individual dt, the 
mean depths of the data in individual transits. 

Some transit detection algorithms look for single tran- 
sits by searching for sharp flux changes rather than by fitting 
a step function to the phase-folded signal. Such methods can 
perform better when photometric trends within and between 
nights are strong. In that case, V(n) should be computed 



by using the variance of the mean flux difference between 
two neighbouring stretches of data (as used by the detec- 
tion algorithm) rather that the variance of the mean flux 
during the transit. This may also be a suitable approxima- 
tion when the detection is performed by visually inspecting 
the lightcurves. 



3 APPLICATIONS 

3.1 Selecting transit candidates in a survey 

The S r statistic is a significance indicator for potential tran- 
sit signals that, as discussed in Section \l. 81 is superior to the 
usual S/N statistics in the presence of covariant noise. It can 
therefore be used to estimate the significance of a transit 
candidate, i.e. the estimation of the probability that the de- 
tected signal is due to random fluctuations rather than real 
transits. 

Existing surveys have shown that transiting planets pro- 
duce signals that are near the detection threshold, and that 
it is important to include candidates that are as near as 
possible to the spurious detections. However, the presence 
of red noise makes this difficult. Most surveys have tackled 
this problem by staying well clear of the minimum threshold, 
and retaining only clear signals with high signal-to-noise for 
subsequent follow-up and confirmation. But this comes at 
the price of missing potentially interesting candidates. The 
use of the S r statistic to evaluate the significance of the 
transit detections should allow a lower detection threshold 
and better identification of spurious signals. It is also useful 
for ranking the candidates before the demanding follow-up 
efforts. 

The OGLE survey has been striving to set the detection 
threshold as low as possible, even at the risk of including a 
few spurious detections. Here we apply our formalism to the 
OGLE candidates and show that it provides a more reliable 
definition of the effective detection threshold and a better 
discrimination of false positives. 

Using the V(n) function calculated for each OGLE tran- 
sit candidate, we computed the S r indicator with Eq. |7| for 
the published OGLE transit candidates. The left panel of 
Fig. [5] displays the results for the 2002/3 observing season 
(OGLE-TR-60 to OGLE-TR-132), for w hich the radial ve - 
locity follow-up is essentially complete iPont et al.ll2005T) . 
For comparison, the right panel shows the white-noise detec- 
tion signal-to-noise Sd- Crosses indicate suspected false pos- 
itives according to the spectroscopic follow-up, and filled cir- 
cles the confirmed eclipsing binaries and transiting planets. 
The apparent threshold in the S r plot, around S r = 8, con- 
stitutes a much sharper detection criterion than Sd- Of the 
nine objects with S r < 9 among the candidates of the first 
two seasons (OGLE-TR-48, TR-55, TR-58, TR-89, TR-118, 
TR-124, TR-125, TR-127 and TR-131), seven were shown by 
the spectroscopic follow-up and furthe r photometric mea- 
surements to be likely false positives teouchv et alJl2005l : 
iPont et all2005t A. Udalski, priv. comm.). Only OGLE- TR- 
55 and OGLE- TR-125 were confirmed as bona fide eclipsing 
binaries. 

The white-noise Sd offers a much poorer separation of 
the false positives, especially for the brighter targets. As ex- 
pected, the Sr- statistic eliminates the strong dependence of 
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Figure 9. Left— S r versus magnitude for the OGLE candidates 
of the 2002/3 season. Filled circles indicate candidates confirmed 
by the radial velocity or photometric follow-up as bona fide tran- 
sits or eclipses, crosses stand for the possible false positives ac- 
cording to the spectroscopic follow-up, and squares mark the can- 
didates showing obvious sinusoidal modulations in the lightcurve. 
Right— same axes and symbols, with transit signal-to-noise cal- 
culated assuming uncorrelated noise. The dotted line shows a 
threshold signal-to-noise value of 8. 



the threshold on magnitude, since systematics become the 
dominant source of noise for the brighter targets. The bright 
candidates most likely to be false positives according to the 
radial velocity follow-up get a much lower ranking, closer to 
the detection threshold near S r = 8. At the bright end of 
the magnitude range, the significance decreases by a factor 
~ 3 for OGLE targets compared to the white-noise analy- 
sis, which is approximately the square root of the average 
number of data points in an individual transit, illustrating 
the transition from the uncorrelated regime to the heavily 
correlated regime. 

These results confirm the value of S r as a robust confi- 
dence indicator, superseding white-noise signal-to-noise in- 
dicators. It also shows that thresholding S r is a very good 
model for the combination of white signal-to-noise, spectral 
'SDE' index of the BLS and by-eye selection performed by 
the OGLE team. 

For the 2004 season (OGLE-TR-138 to TR-177, 
lUdalski et ail 12004) . the OGLE team lowered the accep- 
tance threshold for the published transiting candidates, in 
an attempt to detect more transiting planets. The spectro- 
scopic follow-up of these candidates has not been completed 
to date. It is therefore useful to apply our approach to these 
new candidates.We find that OGLE-TR-143, TR-150, TR- 
152, TR-160, TR-161, TR-162, TR-166, TR-168, TR-172, 
TR-173, and TR-174 have S r < 8 and are therefore highly 
likely to be false positives. As a consequence, the absence 
of a spectroscopic eclipsing binary signature for these ob- 
jects will not be a compelling indication that they host a 
transiting planet. 

By examining published transit candidates and infer- 
ring plausible values of a r from published information, we 
have found clear indications that the S r statistic also pro- 
vides useful results when applied to other surveys. For in- 
stance, the TrES-1 transiting planet detected by the TrES 
network has Sd ~ 35 and <SV ~ 12. Its high white-noise sig- 
nificance would make it difficult to understand why such a 
deep transit was the only planet detected by the TrES sur- 
vey - given that shallower planetary transits are expected 
to be much more abundant. However, the relatively low S r 
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Figure 10. Phase-folded lightcurve of OGLE-TR-161 as an ex- 
ample of detection with low S r (S r = 5.5). Note the structure 
outside the candidate transit signal. 



indicates that it may be situated near the actual detection 
threshold. 



3.2 Improved error estimates 

The expression of Eq.E]for ad also gives an estimate of the 
uncertainty on the depth of the transit, taking into account 
the correlation in the noise. This expression can replace un- 
certainty estimates based on white-noise x 2 statistics that 
are likely to yield underestimated error intervals. In turn, 
the ad can be used to derive the uncertainties on the stellar 
and planetary parameters. 

This method is in essence si milar to the "residual per- 
mutation" metho d that we used in lBouch v et alJ i2005t) and 
iPont et all i2005l) . earlier suggested in lJenkins et alJ (l2002fi 
under the name "segmented bootstrap" . 

This method can also be extended to compute realistic 
uncertainties on other parameters than transit depth, such 
as the duration, orbital inclination or central epoch. This 
can be done by using the V function to estimate the ex- 
pected variance between the observed data and the actual 
value, using only the points that are contributing to the de- 
termination of the parameter in question (for instance, for 
the transit epoch, the points during the transit ingress and 
egress) . The value of the parameter can be moved away from 
the best-fitting value until the sum-of-squares is equal to the 
variance expected from Eq.||j] 

We have used this method on the multi-colour, multi- 
site transit photometry of HD 189733 to show that the white- 
noise uncertainties were underestimated and that correlated 
noise could accoun t for unexplained features in the results 
jBakos et al]l200rJ) . 



3.3 Evaluating the potential of planetary transit 
surveys 

In order to evaluate the potential of a given planetary tran- 
sit survey in terms of planet detection (e.g. at the plan- 
ning stage), a model of the transit detection threshold is 
required. In previous sections we showed why a threshold in 
the S r statistic can provide a significant improvement over 
the white-noise statistic generally used for this purpose. 

3. 3. 1 Numerical value of the S r threshold 

The numerical value of the detection threshold in terms of 
S r will depend on the time sampling of the lightcurve and 
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Figure 11. Histogram of S r for 1000 simulated lightcurves with- 
out transit signals, with a time sampling of 15 minutes over 50 
nights. Solid line: red+white noise, dotted line: white noise. 

on the desired rate of false alarms considered acceptable. 
In the OGLE survey in Carina for instance, we find that 
false alarms become predominant over positive detections 
for S r < 9 (see Section fc.H . The lowest confirmed planetary 
transit has S r = 11.6, and the lowest confirmed eclipsing 
binary has S r = 8.8. In our simulations for 50 consecu- 
tive nights with white, red and entirely correlated noise, 
the threshold S r values for false positives are very rarely 
larger than 7 (see Fig. II It , although correlated lightcurves 
do produce false positives up to S r ~ 12 at rates of a few 
per thousands. The white-no ise values are consis tent with 
threshold values estimated bv lJenkins et alJ J2002f) between 
5.5 and 7.5 for surveys spanning 1 week to 4 years, using the 
white-noise analogous to S r . Published information about 
transit candidates from other surveys than OGLE also im- 
ply similar values of the threshold. 

In summary, the effective detection threshold in actual 
cases will depend on the time sampling, desired false alarm 
rate and the detection algorithm used. It will typically be in 
the 7-9 range. 

3.3.2 Behaviour of an S r threshold 

An S r threshold has a different behaviour than the white- 
noise Sd generally used. S r is not proportional to the disper- 
sion of individual points <7rj, it "saturates" before oo leaves 
the photon-noise regime towards the systematic-dominated 
regime. For values of oy in the range of a few millimag, 
this implies that most planetary transits will stay below the 
detection threshold regardless of the magnitude, with im- 
portant implications for the detection potential of transit 
surveys. 

Figure [T2l plots the detection threshold in terms of tran- 
sit depth (from Eg. I13IIT51 with a photon noise of 0.01 mag 
at the mi magnitude, a scintillation noise of 2 mmag and 
no sky background noise. The dotted line shows the equiv- 



Figure 12. Detection threshold in terms of transit depth as a 
function of magnitude, with a white noise dominated by photon- 
noise and scintillation and a red noise characterised by ay = 
3 mmag (solid line) and white noise only (dotted line) for a typical 
planetary transit signal with P = 3.5 days, mi is the magnitude 
for which the photon noise is 0.015 mag. 



alent relation if the noise is assumed to be white. Figure [T2"l 
shows that taking red noise into account makes an enor- 
mous difference in the detection threshold. The difference is 
largest for the brighter targets, which are also the most im- 
portant targets for planet detection. Since most hot Jupiters 
produce transits of depth 1% or shallower, and have peri- 
ods of 3-4 days or higher, one conclusion of Figure 1121 is 
that red noise will make transit surveys much less efficient 
in terms of planetary detection than would be indicated by 
white noise estimates. The loss of efficiency will be even 
higher than suggested by a simple comparison of the de- 
tection threshold, since the presence of red noise causes the 
detectability zone to stay constantly above the typical depth 
of hot Jupiter transits even for the brightest stars in the sur- 
vey. Therefore, detections will be confined to peculiar cases 
such as exceptionally deep transits or favourable periods (see 
Fig. I13H - precisely as observed in the characteristics of the 
seven transiting planets detected to date by transit surveys. 

The introduction of red noise in Eo, I13l has another in- 
teresting consequence: it introduces a somewhat steeper de- 
pendence of detectability on period than in the case of white 
noise. The reason is that for short periods, an equivalent 
number of data points during the transit will be distributed 
over a larger number of transits, mitigating the effect of 
the covariance. Figure [TBI shows the results of Monte Carlo 
simulations computing the proportion of detected transits 
as a function of periods, with the setup of the OGLE sur- 
vey (1100 points over about 60 nights), at the bright limit 
(oq — oy), with oy = 3.1 mmag and a threshold for detec- 
tion St = 8. The presence of red noise introduces a sharp 
decline of the detectability above a rather short period. For 
instance, for a Solar-type primary, the detectability is neg- 
ligible above P — 2 days. This contributes to explain the 
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Figure 13. Detection probability as a function of period, with 
ay = 3.1 mmag (solid curves) and oy = (dotted curves) for a 
primary radius of 1.0 R© and a survey durations of ~ 50 nights. 
The secondary radius is fixed to 0.11 Rq, <j w is set to 3 mmag. 
The detection criterion is S r > 8 and the presence of data in at 
least three different transits. 

apparent inconsistencies of the results of the OGLE survey 
(3 Very Hot Jupiters with P=l-2 days, for 2 hot Jupiters) 
compared to radial velocity surveys (more than two dozens 
hot Jupiters and no P < 2 days planet). The period depen- 
dence of detectab ility in the O GLE survey is even steeper 
than modelled bv lGaudU <l2005t) with a white- noise approxi- 
mation (see next section for quantitative estimates). 

In our simulations as well as in the OGLE data, we also 
found a residual dependence of the effective S r detection 
threshold on period. This is due to the fact that detection 
algorithms have more difficulty finding low signal-to-noise 
transits at longer periods, at a given S r level, because with 
less transits sampled there are more possible "foldings" of 
real transits with systematic fluctuations to produce false 
detections. 

3.3.3 Detection potential of transit surveys with a S r 
threshold 

The potential of a planetary transit survey will depend not 
only on the detection threshold but also on the observed 
target populations. These will be specific to each case and 
require specific simulations. Without going into the details 
of any specific survey, it is interesting here to examine the 
effect of the amplitude of the red noise, expressed by a r , 
on the potential of a transit survey with typical parameters. 
We have run a simulation for a "typical" survey consisting of 
50 nights of observations of 10000 targets with photon noise 
lower than 0.01 mag. We assumed that 1% of the targets 
have a hot Jupiter, with a log-flat distribution of periods 
between 3 and 10 days, that half of the targets are binary 
or blended, and used star/planet radius ratios drawn from 
a log- normal distribution centered on —1.15 with dispersion 
0.18, and a sampling interval of 5 minutes. 

Effect of systematics on the total yield 
Figure [TTI shows the results in terms of expected number of 
transiting hot Jupiter detections as a function of the value of 
the r parameter, assumed to be identical for all targets. It 
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Figure 14. Number of transiting hot Jupiters (top curve) and 
hot Neptunes (bottom curve) found in our "typical" ground-based 
transit survey simulation, as a function of the oy red-noise pa- 
rameter. Hot Neptunes are assumed here to be ten times more 
abundant than hot Jupiters, and to have radii around 0.3R,/. 

illustrates the drastic effect of the amplitude of ay on the po- 
tential of ground-based surveys. For values of 2-3 millimag, 
the expected yield is reduced by a factor 2 to 4 compared to 
the white-noise expression. Photometry with red noise above 
a few millimag cannot be expected to provide a significant 
contribution to the detection of transiting planets. 

Hot Jupiters vs. Very Hot Jupiters 
We examined the effect of changing the period distribution 
of the transiting planets. We considered the case of "very hot 
Jupiters", with periods between 1.2 and 2 days. Assuming 
white noise, such planets are more easily detected, because 
their transits occur more frequently and also because the ge- 
ometric pro bability of a transit configuration is higher (see 
Gaudi 2005). We find that with ay =3 mmag, the detectabil- 
ity of very hot Jupiters compared to 3-4 days hot Jupiters is 
further increased by a factor ~ 2, and by a factor ~ 3 with 
a r = 5 mmag. This helps to explain the marginal mismatch 
between the period distributions of hot Jupiters detected 
with radial velocities and with photometric transit surveys. 

Total survey duration 
We also considered the effect of modifying the total dura- 
tion of the transit survey. The simulations indicate that the 
presence of red noise lengthens the duration of the survey 
necessary before the number of detections starts to saturate 
(see Fig. 1151 . With purely white noise, the number of de- 
tections starts to saturate after about 50 nights, but with 
a r =3-5 mmag it keeps rising almost linearly beyond 100 
nights. This indicates that correlated noise moves the opti- 
mal length of a transit survey towards longer campaigns on 
the same target fields. Whereas white-noise estimates may 
indicate that 30 — 50 nights of measurements on a field be- 
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Figure 15. Expected number of transiting hot Jupiter detec- 
tions in our "typical" survey as a function of the total number 
of nights of observation, for purely white noise (dotted line), and 
two different levels of red noise: oy = 3 mmag (solid line) and 
ay = 5 mmag (dashed line). 



4 SUMMARY AND IMPLICATIONS 

This study shows that in ground-based surveys for planetary 
transits, the systematics in the photometry have a very im- 
portant effect on the capacity to detect planetary transits, 
and that estimates of detection significance and detection 
threshold based on the assumption of white Gaussian noise 
are not appropriate. We propose a simple formalism to as- 
sess the significance level of detected transits, and to predict 
the detection threshold of an observing campaign. 

We find that the uncertainty on the depth d of a transit 
candidate is 



2 

0d 



nr — ' 



(16) 



where N tr is the number of transits sampled, n k the number 
of data points in the k th transit. V is a function that can be 
calculated through the curve itself from the variance of the 
average flux over transit-length intervals outside the transit. 

Consequently the significance (signal-to-noise) of a 
transit signal candidate in the presence of real photomet- 
ric noise can be calculated by: 



(17) 



Introducing the a r factor to parametrize the red noise, 
the threshold for detection is: 



> s4 



fore moving to another one is the optimal strategy, taking 
red noise into account moves the optimum towards larger 
campaigns. The basic reason behind this result is that, be- 
cause of the trends in the photometry, a higher number of 
individual transits may be required to achieve the necessary 
signal-to-noise for a secure detection. For instance, for the 
five detected OGLE transiting planets the number of indi- 
vidual transits sampled is 7, 12, 9, fO and f 1 - whereas sim- 
ulations indicate that most detections should be based on 
3-5 transits. The expected yield of a survey therefore keeps 
rising until the most common hot Jupiters, with periods of 
3-4 days, can be sampled enough times. 



Hot Neptunes 

Several claims have been made about the capacity of ground- 
based surveys with big telescopes to detect "Hot Neptunes" 
i.e. close-in planets with sizes around 0.3 Rj, all based on 
the white-noise a pproximation (e.g. lHartman et al J 120051: 
iGillon et al.ll2005h . To examine what value of ov such detec- 
tions would require, we repeated the same simulation with 
planets of Neptune size (~ 0.3 Rj). We assumed that such 
planets are ten times more abundant than hot Jupiters. Fig- 
ure 1141 shows that extremely small values of oy would be 
required to attain a significant efficiency in the detection of 
Hot Neptunes, oy < 0.3 mmag. This would be exceedingly 
difficult to achieve in wide-field photometry over long time 
scales. Therefore the photometric detection of transiting Hot 
Neptunes from the ground requires an unlikely leap in the 
treatment of systematic effects in the photometry. 



where a w is the uncorrelated (white) noise in the photom- 
etry. Typical values of oy are of a few millimagnitudes (3 
mmag in the OGLE survey), and typical values of St are 9 
or higher in realistic settings. 

In terms of physical parameters the threshold is: 



Sr = a{r/Rf 
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Our approa ch be ars si milarities with that 
of iJenkins et alJ (120021) . iKovacs et all J20021) and 
ISirko fc Paczvriskil (I2003I) . in accounting for the sys- 
tematics in the noise by studying the properties of the 
photometric time series over longer time intervals. We tried 
to devise a method that is easy to apply, does not require 
long computation, and can be extrapolated to ongoing or 
future surveys with minimal extra assumptions. 

In general, accounting for the effect of systematics re- 
sults in much lower yield estimates for ground-based transit 
surveys. We believe that these estimates are more realistic 
than white-noise estimates. They are also in much better 
agreement with the low actual detection rate. These more 
realistic estimates can help in the design of transit surveys. 
Including the red noise modifies the dependence of the yield 
estimates on the survey parameters. Thus, survey design can 
be affected not only quantitatively by lowering the predicted 
yields, but also qualitatively. The presence of systematics 
has several important effects: (1) it reduces the premium 
placed on the brightest end of a survey for detection, (2) it 
steepens the period dependence of detectability and (3) it 
reduces the importance of denser time sampling. 
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These elements may for instance weigh a survey strat- 
egy towards including more fields with a longer separation 
between individual measurements, and spending more time 
on a given field. 

In the case of surveys targeting stell ar systems, red noise 
modifies the conclusions o f lGauditeOOEh that if the detection 
threshold reaches a hot Jupiter transit in front of a target 
of any magnitude, such transits will be detected for targets 
of all magnitudes. Red noise will hamper the detection for 
the brightest magnitudes. 

Our study also underlines the crucial importance of 
beating down the hour-timescale systematics in the photom- 
etry of transit surveys. This, of course, is well realized by 
all groups conducting such surveys, and considerable effort 
has been invested in removing the systematics. For instance, 
for the OGLE survey, two schemes have been developed to 
remove the sys tematics by using the correlation among all 
the li ghtcurves jKruszewski fc Semeniukl2003UTamuz et alJ 
120051) . 

However, for a given facility there may be a lower limit 
below which it will be practically impossible to reduce ay. 
Since transit surveys need as large a time coverage as they 
can get, they are compelled to use all clear nights whether 
perfectly photometric or not. At the millimagnitude level, 
interactions between airmass, colour, absorption, tracking, 
flatfielding and seeing all have the potential to introduce 
systematic trends. For instance, systematic residuals can be 
caused by the interaction of seeing with a close unseen com- 
panion, or by the tracking drift moving part of an object 
across pixels with Hatfield errors. 

Sub- millimagnitude a ccuracy has been report ed in some 
occasions, for instance bv lHartman et all (|2pQ5:). This can 
be attained with a very good sampling of the PSF on a 
large number of pixels, as possible with a large telescope on 
small fields. However, the reported results concern only a 
few nights, probably of above-average quality. It remains to 
be seen if such accuracy can be maintained over a period of 
time long enough for an efficient planetary transit survey. 
Moreover, the use of a large telescope on a small field im- 
plies that the detected transit candidates will be too faint 
for spectroscopic confirmation, limiting their usefulness. As 
shown by the existing surveys, most transit candidates are 
actually eclipsing binaries, so that spectroscopic follow-up 
is indispensable in order to confirm the planetary nature of 
the transiting companion. 

An obvious solution to the problem of photometric sys- 
tematics is to move into space, as was indeed done soon 
after the di scovery of 51 Peg wit h HST in the globular clus- 
ter 47Tuc iGilliland et alJl200d) . More recently, Sahu et al. 
(programme GO-9750) have carried out an HST survey for 
transits in the Galactic plane. The problem, however, is that 
the detected transit candidates are again too faint for spec- 
troscopic follow-up, so that planets cannot be told apart 
from stellar eclipses. 

In the coming years, two space-born planetary transit 
missions are scheduled, CoRoT and Kepler. They will be free 
from systematics caused by the atmosphere. The approach 
of this paper, however, also applies to space-born data. The 
transit detection threshold for CoRoT and Kepler will also 
be set by the characteristics of the red noise, both instru- 
mental and due to stellar variability, because transit-length 
intervals will be sufficient for the white noise to average out. 



The most relevant factor will be the stability of the hour- 
timescale average of the measured fluxes. 

In summary, this work offers a practical way, both con- 
ceptually and computationally, to empirically deal with the 
complex problem of covariant noise in extrasolar planetary 
transit surveys, both ground-based and space-born. 
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